Alignment-free Sequence Comparison for Biologically Realistic Sequences of Moderate Length
نویسندگان
چکیده
منابع مشابه
Alignment-free sequence comparison for biologically realistic sequences of moderate length.
The D(2) statistic, defined as the number of matches of words of some pre-specified length k, is a computationally fast alignment-free measure of biological sequence similarity. However there is some debate about its suitability for this purpose as the variability in D(2) may be dominated by the terms that reflect the noise in each of the single sequences only. We examine the extent of the prob...
متن کاملMultiple alignment-free sequence comparison
MOTIVATION Recently, a range of new statistics have become available for the alignment-free comparison of two sequences based on k-tuple word content. Here, we extend these statistics to the simultaneous comparison of more than two sequences. Our suite of statistics contains, first, C(*)1 and C(S)1, extensions of statistics for pairwise comparison of the joint k-tuple content of all the sequenc...
متن کاملBiologically Relevant Multiple Sequence Alignment
BIOLOGICALLY RELEVANT MULTIPLE SEQUENCE ALIGNMENT Hyrum D. Carroll Department of Computer Science Doctor of Philosophy Researchers use multiple sequence alignment algorithms to detect conserved regions in genetic sequences and to identify drug docking sites for drug development. In this dissertation, a novel algorithm is presented for using physicochemical properties to increase the accuracy of...
متن کاملAn alignment-free model for comparison of regulatory sequences
MOTIVATION Some recent comparative studies have revealed that regulatory regions can retain function over large evolutionary distances, even though the DNA sequences are divergent and difficult to align. It is also known that such enhancers can drive very similar expression patterns. This poses a challenge for the in silico detection of biologically related sequences, as they can only be discov...
متن کاملA statistical method for alignment-free comparison of regulatory sequences
MOTIVATION The similarity of two biological sequences has traditionally been assessed within the well-established framework of alignment. Here we focus on the task of identifying functional relationships between cis-regulatory sequences that are non-orthologous or greatly diverged. 'Alignment-free' measures of sequence similarity are required in this regime. RESULTS We investigate the use of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Statistical Applications in Genetics and Molecular Biology
سال: 2011
ISSN: 1544-6115
DOI: 10.2202/1544-6115.1724